AITopics | open-ended text generation

Collaborating Authors

open-ended text generation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Breaking the Likelihood Trap: Variance-Calibrated Modulation for Large Language Model Decoding

Ding, Yuanhao, Li, Meimingwei, Arias, Esteban Garces, Aßenmacher, Matthias, Heumann, Christian, Zhang, Chongsheng

arXiv.org Machine LearningJun-23-2026

In open-ended generation, LLMs frequently fall into the "likelihood trap", marked by repetitive degeneration and vocabulary dullness, creating a discrepancy between machine-generated and human-written text. While post-hoc tail truncation (e.g., Top-$p$, Min-$p$) avoids sampling from the unreliable tail, it can over-sample from the uncalibrated head and misalign generation with human lexical preferences; fixed scalar repetition penalties likewise ignore variation in logit scale across inference steps, potentially disrupting semantic coherence. To address both limitations, we propose Variance-Calibrated Modulation (VCM), a training-free pre-decoding intervention that reshapes the probability distribution before truncation through two dynamic mechanisms: (1) Contextual Searchlight via PMI, which suppresses global stopwords while elevating context-evoked tokens, and (2) Adaptive Self-Debiasing, which uses real-time logit standard deviation for scale-invariant penalization. Across open-ended generation, factual QA, and mathematical reasoning, VCM consistently mitigates the likelihood trap. With negligible computational overhead, VCM integrates with existing decoding strategies, improving diversity, coherence, and, particularly at higher decoding temperatures, reasoning accuracy.

computational linguistic, large language model, natural language, (18 more...)

arXiv.org Machine Learning

2606.22511

Country:

Asia > Middle East > UAE (0.46)
North America > United States (0.46)
Europe > Austria (0.28)
Europe > Germany (0.28)

Genre: Research Report > Experimental Study (0.46)

Industry:

Leisure & Entertainment > Sports (1.00)
Education (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation

Neural Information Processing SystemsJun-11-2026, 08:07:34 GMT

Large language models (LLMs), despite their impressive performance across a wide range of tasks, often struggle to balance two competing objectives in open-ended text generation: fostering diversity and creativity while preserving logical coherence. Existing truncated sampling techniques, including temperature scaling, top- (nucleus) sampling, and min-sampling, aim to manage this trade-off.

artificial intelligence, large language model, natural language, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.59)

Add feedback

df438caa36714f69277daa92d608dd63-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 09:31:42 GMT

arxiv preprint arxiv, factuality, knowledge, (13 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Illinois (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)

Add feedback

Factuality Enhanced Language Models for Open-Ended Text Generation

Neural Information Processing SystemsDec-25-2025, 12:26:47 GMT

Pretrained language models (LMs) are susceptible to generate text with nonfactual information. In this work, we measure and improve the factual accuracy of large-scale LMs for open-ended text generation. We design the FactualityPrompts test set and metrics to measure the factuality of LM generations. Based on that, we study the factual accuracy of LMs with parameter sizes ranging from 126M to 530B. Interestingly, we find that larger LMs are more factual than smaller ones, although a previous study suggests that larger LMs can be less truthful in terms of misconceptions. In addition, popular sampling algorithms (e.g., top-p) in open-ended text generation can harm the factuality due to the ``uniform randomness'' introduced at every sampling step. We propose the factual-nucleus sampling algorithm that dynamically adapts the randomness to improve the factuality of generation while maintaining quality. Furthermore, we analyze the inefficiencies of the standard training method in learning correct associations between entities from factual text corpus (e.g., Wikipedia). We propose a factuality-enhanced training method that uses TopicPrefix for better awareness of facts and sentence completion as the training objective, which can vastly reduce the factual errors.

factuality enhanced language model, name change, open-ended text generation, (7 more...)

Neural Information Processing Systems

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.07)

Genre: Instructional Material (0.60)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Neural Information Processing SystemsDec-23-2025, 21:52:17 GMT

As major progress is made in open-ended text generation, measuring how close machine-generated text is to human language remains a critical open problem. We introduce Mauve, a comparison measure for open-ended text generation, which directly compares the learnt distribution from a text generation model to the distribution of human-written text using divergence frontiers.

divergence frontier, name change, neural text and human text, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.94)

Add feedback

In-Distribution Steering: Balancing Control and Coherence in Language Model Generation

Vogels, Arthur, Wong, Benjamin, Choho, Yann, Blangero, Annabelle, Bhan, Milan

arXiv.org Artificial IntelligenceOct-16-2025

Activation steering methods control large language model (LLM) behavior by modifying internal activations at inference time. However, most existing activation steering methods rely on a fixed steering strength, leading to either insufficient control or unadapted intervention that degrades text plausibility and coherence. We introduce In-Distribution Steering (IDS), a novel method that adapts steering strength based on the input data distribution in representation space. IDS dynamically adjusts interventions according to how far a given input lies within the distribution, enabling adaptive intervention and generation stability during text generation. Experiments demonstrate that IDS achieves strong accuracy on classification tasks while producing coherent text without collapse, making IDS particularly well suited for real-world applications.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.13285

Country: North America > United States (0.46)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs

Goel, Raghavv, Agrawal, Sudhanshu, Gagrani, Mukul, Park, Junyoung, Zao, Yifan, Zhang, He, Liu, Tian, Yang, Yiping, Yuan, Xin, Lu, Jiuyan, Lott, Chris, Lee, Mingu

arXiv.org Artificial IntelligenceSep-30-2025

In this paper, we introduce a simple training-free technique to improve the performance of drafter-based speculative decoding (SpD) methods that incorporates language modeling head (LM head) during drafting process. A drafter-based speculative decoding leverages one or more smaller language models, a.k.a. drafters or draft models, to sample a draft sequence or tree consisting of multiple tokens, followed by verification by a base LLM, a target model, accepting a subset as its valid generation. As it is usually considered that the speculative decoding requires one-to-one mapping between vocabularies of the target model and the draft model, it has been natural to share the vocabulary between them, or even share the LM head as in EAGLE or Medusa. We first identify that this draft token sampling scheme inherently contains an unnecessary inference overhead in drafting, especially for some target LLMs with very large vocabularies. Then, we propose a simple technique, VocabTrim, to mitigate the drafting overhead to improve the generation speed in memory-bound environment. VocabTrim reconstructs the drafter LM head to contain only a limited set of tokens, selected by the most frequently sampled from the vocabulary of the target model. While limiting the vocabulary in drafting slightly degrades the acceptance rate, it significantly reduces the drafting latency in memory-bound process which is often the case on edge devices, resulting in higher memory-bound speed up (MBSU). We show that our method can boost the memory-bound speed-up for Llama-3 models on Spec-Bench, specifically by 16% for Llama-3.2-3B-Instruct.

drafter, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2506.22694

Country: North America > United States (0.24)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Factuality Enhanced Language Models for Open-Ended Text Generation

Neural Information Processing SystemsAug-19-2025, 12:23:49 GMT

Pretrained language models (LMs) are susceptible to generate text with nonfac-tual information. In this work, we measure and improve the factual accuracy of large-scale LMs for open-ended text generation.

arxiv preprint arxiv, large language model, natural language, (16 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Illinois (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)

Add feedback

Filters

Collaborating Authors

open-ended text generation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Breaking the Likelihood Trap: Variance-Calibrated Modulation for Large Language Model Decoding

Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation

MAUVE_Evaluating_Open_Ended_Text_Generation(4)

df438caa36714f69277daa92d608dd63-Paper-Conference.pdf

MAUVE_Evaluating_Open_Ended_Text_Generation(4)

Factuality Enhanced Language Models for Open-Ended Text Generation

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

In-Distribution Steering: Balancing Control and Coherence in Language Model Generation

VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs

Factuality Enhanced Language Models for Open-Ended Text Generation